Skip to main content

Documentation Index

Fetch the complete documentation index at: https://superdoc-dependabot-npm_and_yarn-npm_and_yarn-e04d5d616f.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Supported Input Formats

Currently, SuperDoc API supports DOCX (Microsoft Word) files as input:

DOCX (Microsoft Word)

.docx - Office Open XML format - Full formatting support - Maximum file size: 25MB (authenticated) / 5MB (public) - Supports all modern Word features

DOCX Compatibility

SuperDoc is compatible with DOCX files created by:
  • Microsoft Word (2007 and later)
  • Google Docs (exported as DOCX)
  • LibreOffice Writer (saved as DOCX)
  • Apple Pages (exported as DOCX)
  • Other Office suites that support OOXML
Looking for support for other input formats? Contact us to discuss your needs.

Supported Output Formats

SuperDoc converts documents to PDF format:

PDF (Portable Document Format)

.pdf - Adobe PDF format - High-fidelity conversion - Preserves formatting and layout - Universal compatibility - Optimized for web and print

PDF Features

Our PDF output includes:

Text Preservation

  • All text remains selectable - Fonts embedded or substituted - Character encoding preserved

Layout Fidelity

  • Page dimensions maintained - Margins and spacing preserved - Column layouts intact

Interactive Elements

  • Clickable hyperlinks - Bookmarks for navigation - Form fields (when applicable)

Metadata

  • Document properties - Creation timestamps - Author information

File Size Considerations

Input File Limits

Endpoint TypeMaximum SizeRecommended Size
Authenticated25MB< 10MB
Public5MB< 2MB

Output File Optimization

SuperDoc automatically optimizes PDF output:
  • Image compression - Reduces file size while maintaining quality
  • Font subsetting - Includes only used characters
  • Metadata cleanup - Removes unnecessary data
  • Structure optimization - Efficient PDF organization
Large input files result in proportionally large PDF outputs. Consider compressing images in your DOCX before conversion.

Format-Specific Features

DOCX Input Features

  • Rich text formatting (bold, italic, underline, etc.)
  • Multiple font families and sizes
  • Text colors and highlighting
  • Paragraph alignment and spacing
  • Bullet points and numbered lists
  • Indentation and hanging indents
  • Multiple sections with different formatting - Headers and footers (including different first page) - Page numbers and fields - Table of contents - Footnotes and endnotes - Page and section breaks
  • Complex table structures - Cell formatting and borders - Table styles and themes - Merged cells and nested tables - Images and embedded objects - Charts and SmartArt graphics
  • Hyperlinks (internal and external)
  • Cross-references
  • Comments and track changes*
  • Form fields and controls
  • Text boxes and shapes
  • Watermarks and backgrounds
*Track changes are flattened (accepted changes only appear in PDF)

PDF Output Characteristics

  • PDF 1.4 compatibility - Ensures maximum compatibility
  • PDF/A compliance available on request
  • Searchable text - All text remains selectable
  • Vector graphics - Scalable elements preserved
  • No password protection by default - Print permissions enabled - Copy permissions enabled - Modification restrictions available on request
  • Tagged PDF structure
  • Alternative text for images (when present in source)
  • Reading order preservation
  • Logical document structure

Quality Comparison

Before and After Conversion

ElementDOCX SourcePDF OutputFidelity
Text formatting✅ Rich formatting✅ Preserved99.9%
Page layout✅ Complex layouts✅ Maintained98%
Images✅ High resolution✅ Optimized100%
Tables✅ Complex tables✅ Preserved97%
Hyperlinks✅ Active links✅ Clickable100%
Comments✅ Reviewers’ notes⚠️ FlattenedN/A

Future Format Support

We’re actively working on expanding format support:

Coming Soon

  • RTF (Rich Text Format) - ODT (OpenDocument Text) - HTML to PDF conversion

Under Consideration

  • PowerPoint to PDF - Excel to PDF - Markdown to PDF
Have a specific format requirement? Let us know and we’ll consider adding it to our roadmap.

Best Practices by Format

Optimizing DOCX for Conversion

  1. Use standard fonts - Arial, Times New Roman, Calibri work best
  2. Embed custom fonts - Ensure fonts are included in the DOCX file
  3. Optimize images - Use appropriate resolution (300 DPI for print, 150 DPI for web)
  4. Simplify layouts - Avoid overly complex table structures
  5. Test compatibility - Preview in different Word versions

PDF Output Optimization

  1. Consider end use - Web viewing vs. print requirements
  2. Monitor file size - Balance quality with download speed
  3. Verify links - Test hyperlinks in the output PDF
  4. Check accessibility - Ensure proper reading order
  5. Validate structure - Confirm all elements converted correctly

Technical Specifications

DOCX Processing

  • Engine: LibreOffice-based conversion
  • Memory limit: 1GB per conversion
  • Timeout: 60 seconds maximum
  • Concurrent processing: Load-balanced across multiple servers

PDF Generation

  • Renderer: Gotenberg PDF engine
  • Compression: Automatic optimization
  • Color space: RGB (can convert to CMYK on request)
  • Resolution: Maintains source image DPI up to 300 DPI